Searching Proper Names in Databases
نویسندگان
چکیده
Identifying names — e.g., author names or company names — is still an open problem. In this paper we review known similarity measures. These measures deal with phonetic similarity, typing errors and plain string similarity. We show experimentally that all three approaches lead to significant better retrieval quality than plain identity. Furthermore, we demonstrate that combinations of different similarity measures perform even better than any single technique.
منابع مشابه
Discovering Hidden Analogies in an Online Humanities Database
VOLUMINOUS DATABASES CONTAIN HIDDEN KNowLmm-i.e., literatures that are logically but not bibliographically linked. Unlinked literatures containing academically interesting commonalities cannot be retrieved via normal searching methods. Extracting hidden knowledge from humanities databases is especially problematic because the literature, written in “everyday” rather than technical language, lac...
متن کاملName Searching and Information Retrieval
The main application of name searching has b e ~ name matching in a database of names. This paper discusses a different application: improving information retrieval through name recognition. It investigates name recognition accuracy, and the effect on retrieval performance of indexing and searching personal names differently from non-name terms in the context of ranked retrieval. The main concl...
متن کاملMatchsimile: a Flexible Approximate Matching Tool for Searching Proper Name
We present the architecture and algorithms behind Matchsimile, an approximate string matching lookup tool especially designed for extracting person and company names from large texts. Part of a larger information extraction environment, this specific engine receives a large set of proper names to search for, a text to search, and search options; and outputs all the occurrences of the names foun...
متن کاملارزیابی رابطهای جست و جو در پایگاههای پزشکی مبتنی بر شواهد
Introduction: The existence of proper search interfaces in evidence based medicine databases will lead to quick access to evidence based medicine. Given The necessity and their importance, the aim of this study is an evaluation of search interfaces in evidence based medicine databases. Methods: This study was an applied research, Carried through survey method. The study population was 12 evide...
متن کاملProper Name Translation in Cross-Language Information Retrieval
Recently, language barrier becomes the major problem for people to search, retrieve, and understand WWW documents in different languages. This paper deals with query translation issue in cross-language information retrieval, proper names in particular. Models for name identification, name translation and name searching are presented. The recall rates and the precision rates for the identificati...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1995